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q ' Abstract 

The quantum version of communication complexity allows the two communicating parties to ex- 
change qubits and/or to make use of prior entanglement (shared EPR-pairs). Some lower bound tech- 
niques are available for qubit communication complexity [ p3[ |l9[ |l2[ g], but except for the inner product 
-^ function [ 12 1, no bounds are known for the model with unlimited prior entanglement. We show that the 

"log rank" lower bound extends to the strongest model (qubit communication + unlimited prior entan- 
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glement). By relating the rank of the communication matrix to properties of polynomials, we are able 
to derive some strong bounds for exact protocols. In particular, we prove both the "log-rank conjecture" 
and the polynomial equivalence of quantum and classical communication complexity for various classes 
s./ \ of functions. We also derive some weaker bounds for bounded-error quantum protocols. 

1 Introduction and Statement of Results 

fN| ■ Communication complexity deals with the following kind of problem. There are two separated parties, 

usually called Alice and Bob. Alice receives some input x € X, Bob receives some y £ Y, and together 
they want to compute some function f(x, y) which depends on both x and y. Alice and Bob are allowed 
infinite computational power, but communication between them is expensive and has to be minimized. How 
many bits do Alice and Bob have to exchange in the worst-case in order to be able to compute f(x, y)l This 



O^ \ model was introduced by Yao [32] and has been studied extensively, both for its applications (like lower 

Q\ ■ bounds on VLSI and circuits) and for its own sake. We refer to [|(], |l6|] for definitions and results. 

An interesting variant of the above is quantum communication complexity: suppose that Alice and 
Bob each have a quantum computer at their disposal and are allowed to exchange quantum bits (qubits) 
and/or can make use of the quantum correlations given by pre-shared EPR-pairs (these are entangled 2-qubit 
S^ states -i=(|00) + |11)) of which Alice has the first qubit and Bob the second) — can they do with fewer 

communication than in the classical case? The answer is yes. Quantum communication complexity was first 



considered by Yao |T33p and the first example where quantum beats classical communication complexity was 
given in [11]. Bigger (even exponential) gaps have been shown since [|| ||, [2^ ]. 

The question arises how big the gaps between quantum and classical can be for various (classes of) 
functions. In order to answer this, we need to exhibit limits on the power of quantum communication 
complexity, i.e. establish lower bounds — few of which are known currently. The main purpose of this 
paper is to develop tools for proving lower bounds on quantum communication protocols. We present some 
new lower bounds for the case where / is a total Boolean function. Most of our bounds apply only to exact 
quantum protocols, which always output the correct answer. However, we also have some extensions of our 
techniques to the case of bounded-error quantum protocols. 
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1.1 Lower bounds for exact protocols 

Let D(f) denote the classical deterministic communication complexity of /, Q(f) the qubit communica- 
tion complexity, and Q* (/) the qubit communication required if Alice and Bob can also make use of an 
unlimited supply of pre-shared EPR-pairs. Clearly Q*(f) < Q{f) < D(f). Ultimately, we would like to 
show that Q*(f) and D{f) are polynomially related for all total functions / (as are their query complexity 
counterparts [H]). This requires stronger lower bound tools than we have at present. Some lower bound 
methods are available for Q(f) p3| , 19, 12, ||], but the only lower bound known for Q*(f) is for the inner 



product function [jTJ]. A strong and well known lower bound for D(f) is given by the logarithm of the 
rank of the communication matrix for / [p2~|]. As first noted in [||], techniques of [33, [19|] imply that an 



£l(\ogrank(f))-bo\md also holds for Q(f). Our first result is to extend this bound to Q*(f) and to derive 
the optimal constant: 

q*(/) > l ^in. CD 

This implies n/2 lower bounds for the Q* -complexity of the equality and disjointness problems, for which 
no good bounds were known before. This n/2 is tight up to 1 bit, since Alice can send her n-bit input to 
Bob with n/2 qubits and n/2 EPR-pairs using superdense coding [gj. Our corresponding lower bound also 
provides a new proof of optimality of superdense coding. In fact, the same n/2 bound holds for almost all 
functions. Furthermore, proof of the well-known "log-rank conjecture" (D(f) < (\ogrank(f)) k for some 
k) would now imply our desired polynomial equivalence between D(f) and Q*(f) (as already noted for 
D(f) and Q(f) in [§]). However, this conjecture is a long standing open question which is probably hard to 
solve in full generality. 

Secondly, in order to get an algebraic handle on rank(f), we relate it to a property of polynomials. 
It is well known that every total Boolean function g : {0,l} n — ► {0,1} has a unique representation as 
a multilinear polynomial in its n variables. For the case where Alice and Bob's function has the form 
f(x,y) = g(x A y), we show that rank(f) equals the number of monomials mon(g) of the polynomial 
that represents g (rank(f) < mon(g) was shown in [J281]). This number of monomials is often easy to 
count and allows to determine rank(f). The functions f(x, y) = g(x A y) form an important class which 
includes inner product, disjointness, and the functions which give the biggest gaps known between D(f) 



and log rank(f) [ |28| ] (similar techniques work for f(x, y) = g(x V y) or g{x ® y)). 

We use this to show that Q*(f) G Q(D(f)) if g is symmetric. In this case we also show that D(f) is 
close to the classical randomized complexity. Furthermore, Q*(f) < D(f) £ 0(Q*(f) 2 ) if g is monotone. 
For the latter result we rederive a result of Lovasz and Saks pA\ \ using our tools. 

1.2 Lower bounds for bounded-error protocols 

For the case of bounded-error quantum communication protocols, very few lower bounds are currently 
known (exceptions are inner product [12] and the general discrepancy bound JI^j). In particular, no good 



lower bounds are known for the disjointness problem. The best known upper bound for this is 0{y/n\og n) 



qubits []8fl, contrasting with linear classical randomized complexity []17|, [Mj. Since disjointness is a co-NP- 
complete communication problem [J3j] , a good lower bound for this problem would imply lower bounds for 
all NP-hard communication problems. 

In order to attack this problem, we make an effort to extend the above polynomial-based approach 
to bounded-error protocols. We consider the approximate rank rank(f), and show the bound Q2H) > 



(log rank{f))/ 2 for 2-sided bounded-error qubit protocols (again using techniques from [33, 19|]). Unfor- 
tunately, lower bounds on rank(f) are much harder to obtain than for rank(f). If we could prove for the 
case f(x, y) = g(xAy) that rank(f) roughly equals the number of monomials mon(g) of an approximating 
polynomial for g, then a y/n lower bound would follow for disjointness, because we show that this requires 



at least 2^™ monomials to approximate. Since we prove that the quantities rank(f) and mon(g) are in fact 
equal in the exact case, this gives some hope for a similar result rank(f) « mon(g) in the approximating 
case, and hence for resolving the complexity of disjointness. 

The specific bounds that we actually were able to prove for disjointness are more limited at this point: 



Q^DISJ) G fi(log n) for the general case (by an extension of techniques of [120; the log n bound without 
entanglement was already known [Q]), (^(DISJ) G O(n) for 1-round protocols (using a result of [p4]]), and 
Q 2 (DISJ) G O(n) if the error probability has to be < 2 _n . 

Below we sum up the main results, contrasting the exact and bounded-error case. 

• We show that Q*(f) > logrank(f)/2 for exact protocols with unlimited prior EPR-pairs and 
Q2C/O > logrank(f)/2 for qubit protocols without prior EPR-pairs. 

• If /(x, y) = y(x A y) for some Boolean function g, then rank(f) = mon(g). An analogous result 
rank(f) ~ rnoh(g) for the approximate case is open. 

• A polynomial for disjointness, DISJ(x, y) = NOR(x A y), requires 2 n monomials in the exact case 
(implying Q*(DISJ) > n/2), and roughly 2^™ monomials in the approximate case. 

2 Preliminaries 

We use |x| to denote the Hamming weight (number of Is) of x G {0, 1}™, Xj for the 2th bit of x (xq = 0), 
and ej for the string whose only 1 occurs at position i. If x, y G {0, l} n , we use x Ay G {0, l} n for the string 
obtained by bitwise ANDing x and y, and similarly x\l y. Let y : {0, l} n — » {0, 1} be a Boolean function. 
We call g symmetric if g(x) only depends on \x\, and monotone if y cannot decrease if we set more variables 
to 1. It is well known that each g : {0,l} n — ► R has a unique representation as a multilinear polynomial 
g(%) = 2~^sc{i n} a sXs, where X$ is the product of the variables in S and as is a real number. The term 
asXs is called a monomial of g and mon(g) denotes the number of non-zero monomials of g. A polynomial 
p approximates g if \g{x) — p(x)\ < 1/3 for all x G {0, l} n . We use rnoh(g) for the minimal number of 
monomials among all polynomials which approximate g. The degree of a monomial is the number of its 
variables, and the degree of a polynomial is the largest degree of its monomials. 

Let X and Y be finite sets (usually X = Y = {0, l} n ) and / : X x Y -» {0, 1} be a Boolean 
function. For example, equality has EQ(x, y) = 1 iff x = y, disjointness has DISJ(x, y) = 1 iff |x A y| = 
(equivalently, DISJ(x, y) = NOR(x A y)), and inner product has IP(x, y) = 1 iff |x A y| is odd. Mf denotes 
the \X\ x \Y\ Boolean matrix whose x,y entry is f(x,y), and rank(f) denotes the rank of Mf over the 
reals. A rectangle is a subset iS = 5xTCIxYof the domain of /. A 1 -cover for / is a set of rectangles 
which covers all and only Is in Mf. C 1 (f) denotes the minimal size of a 1 -cover for /. For rn > 1, 
we use f Am to denote the Boolean function which is the AND of m independent instances of /. That is, 
/ Am :X m xy m ^{0,l}and/ Am (x 1 ,...,x m ,y 1 ,...,y m ) = /(x 1 ,y 1 )A/(x 2 ,y 2 )A...A/(x m ,y m ). 
Note that Mf A2 is the Kronecker product Mf (g> Mf and hence rank(f Am ) = rank(f) m . 

Alice and Bob want to compute some f : X x Y —> {0,1}. After the protocol they should both know 
/(x,y). Their system has three parts: Alice's part, the 1-qubit channel, and Bob's part. For definitions 
of quantum states and operations, we refer to |Q, |Kj]. In the initial state, Alice and Bob share k EPR- 
pairs and all other qubits are zero. For simplicity we assume Alice and Bob send 1 qubit in turn, and 
at the end the output-bit of the protocol is put on the channel. The assumption that 1 qubit is sent per 
round can be replaced by a fixed number of qubits qi for the ith round. However, in order to be able to 
run a quantum protocol on a superposition of inputs, it is important that the number of qubits sent in the 
fth round is independent of the input (x, y). An ^-qubit protocol is described by unitary transformations 



Ui(x), U2(y), Us(x), U±(y), ... , Ue(x/y). First Alice applies U\{x) to her part and the channel, then Bob 
applies £72 (y) to his part and the channel, etc. 

Q(f) denotes the (worst-case) cost of an optimal qubit protocol that computes / exactly without prior 
entanglement, C*(f) denotes the cost of a protocol that communicates classical bits but can make use of 
an unlimited (but finite) number of shared EPR-pairs, and Q*(f) is the cost of a qubit protocol that can 
use shared EPR-pairs. Q c {f) denotes the cost of a clean qubit protocol without prior entanglement, i.e. a 
protocol that starts with |0)|0)|0) and ends with |0)|/(x,y))|0). We add the superscript "1 round" for 1- 
round protocols, where Alice sends a message to Bob and Bob then sends the output bit. Some simple 
relations that hold between these measures are Q*(f) < Q{f) < D(f) < D lround {f), Q(f) < Q c (f) < 
2Q(f) and Q*(f) < C*(f) < 2Q*(f) [§]. For bounded-error protocols we analogously define Q 2 (f), 
Q%(f), C%(f) for quantum protocols that give the correct answer with probability at least 2/3 on every 
input. We use R^ 1 (/) for the classical bounded-error complexity in the public-coin model [f20|]. 

3 Log-Rank Lower Bound 



As first noted in [||, ||], techniques from [[33|, 19] imply Q(f) G Q(log rank(f)). For completeness we 
prove the following logrank(f) bound for clean quantum protocols in Appendix [A|. This implies Q(f) > 
log rank(f)/2. We then extend this to the case where Alice and Bob share prior entanglement :[] 

Theorem 1 Q c (f) > logrank(f) + 1. 

Theorem! Q*(/)> l0gra ; fe(/) . 

Proof Suppose we have some exact protocol for / that uses £ qubits of communication and k prior EPR- 
pairs. We will build a clean qubit protocol without prior entanglement for f Am . First Alice makes k 
EPR-pairs and sends one half of each pair to Bob (at a cost of k qubits of communication). Now they run the 
protocol to compute the first instance of / (£ qubits of communication). Alice copies the answer to a safe 
place which we will call the 'answer bit' and they reverse the protocol (again £ qubits of communication). 
This gives them back the k EPR-pairs, which they can reuse. Now they compute the second instance of /, 
Alice ANDs the answer into the answer bit (which can be done cleanly), and they reverse the protocol, etc. 
After all m instances of / have been computed, Alice and Bob have the answer f Am (x, y) left and the k 
EPR-pairs, which they uncompute using another k qubits of communication. 

This gives a clean protocol for f Am that uses 2m£+2k qubits and no prior entanglement. By Theorem []} 

2m£ + 2k> Q c {f Am ) > log rank(f Am ) + 1 = m log rank(f) + 1, 

hence 

logrank(f) 2k — 1 

~ 2 2m 

Since this must hold for every m > 0, the theorem follows. □ 

We can derive a stronger bound for C*(/): 
Theorem 3 C*(f) > log rank(f). 

'During discussions we had with Michael Nielsen in Cambridge in the summer of 1999, it appeared that an equivalent result 
can be derived from results about Schmidt numbers in [E5l Section 6.4.2]. 



Proof Since a qubit and an EPR-pair can be used to send 2 classical bits |g], we can devise a qubit protocol 
for / A / using C* (/) qubits (compute the two copies of / in parallel using the classical bit protocol). Hence 
by the previous theorem C*(f) >Q*(f A /) > (logrank(f A f))/2 = logrank(f). □ 

Below we draw some consequences from these log-rank lower bounds. Firstly, Meq is the identity 
matrix, so rank(EQ) = 2". This gives the bounds Q*{EQ) > n/2, C*(EQ) > n (in contrast, C? 2 (EQ) G 
G(logn) and C^EQ) G 0(1))- The disjointness function on n bits is the AND of n disjointnesses on 1 
bit (which have rank 2 each), so rank(DlSJ) = 2 n . The complement of the inner product function has 
rank(f) = 2 n . Thus we have the following strong lower bounds, all tight up to 1 bit:0 

Corollary 1 Q* (EQ) , Q* (DISJ) , Q* (IP) > n/2 and C* (EQ) , C* (DISJ) , C* (IP) > n. 

Komlos [|TJ] has shown that the fraction ofmxm Boolean matrices that have determinant goes to 
as m —>■ oo. Hence almost all 2 n x 2 n Boolean matrices have full rank 2 n , which implies that almost all 
functions have maximal quantum communication complexity: 

Corollary 2 Almost all f : {0, l} n x {0, l} n -► {0, 1} have Q*(f) > n/2 and C*(f) > n. 

We say / satisfies the quantum direct sum property if computing m independent copies of / (without 
prior entanglement) takes mQ(f) qubits of communication in the worst case. (We have no example of an / 
without this property.) Using the same technique as before, we can prove an equivalence between the qubit 
models with and without prior entanglement for such /: 

Corollary 3 If f satisfies the quantum direct sum property, then Q*(f) < Q(f) < 2Q*(f). 

Proof Q*(f) < Q(f) is obvious. Using the techniques of Theorem ^| we have mQ{f) < 2mQ*(f) + k, 
for all m and some fixed k, hence Q(f) < 2Q*(f). □ 

Finally, because of Theorem ||, the well-known "log-rank conjecture" now implies the polynomial equiv- 
alence of deterministic classical communication complexity and exact quantum communication complexity 
(with or without prior entanglement) for all total /: 

Corollary 4 IfD(f) G O ((log rank{f)) k ), thenQ*(f) < Q(f) < D(f) G 0(Q*(f) k )for all f. 

4 A Lower Bound Technique via Polynomials 

4.1 Decompositions and polynomials 

The previous section showed that lower bounds on rank(f) imply lower bounds on Q*(f). In this section 
we relate rank(f) to the number of monomials of a polynomial for / and use this to prove lower bounds 
for some classes of functions. 

We define the decomposition number m(f) of some function / : {0, l} n x {0, l} n — > R as the min- 
imum m such that there exist functions a\(x), . . . , a m (x) and b\(y), . . . , b m (y) (from R n to R) for which 
f{x, y) = J2i^=i ck(%)bi(y) for all x, y. We say that / can be decomposed into the m functions aibi. Without 
loss of generality, the functions Oj , bi may be assumed to be multilinear polynomials. It turns out that the 
decomposition number equals the rank:0 

2 The same bounds for IP are also given in [jl2[|. The bounds for EQ and DISJ are new, and can also be shown to hold for 
zero-error quantum protocols. 

3 The first part of the proof employs a technique of Nisan and Wigderson 128]. They used this to prove log rank(f) £ O(n los3 2 ) 
for a specific /. Our Corollary n below implies that this is tight: log rank(f) € 0(n og3 ) for their /. 



Lemma 1 rank(f) = m(f). 

Proof 

rank(f) < m(f ): Let f(x, y) = YaLi <H(x)h(y), Mi be the matrix defined by Mj(x, y) = a,i(x)bi{y), 
Ti be the row vector whose yth entry is 6j(y). Note that the xth row of Mi is a,(x) times n. Thus all rows 
of Mi are scalar multiples of each other, hence Mi has rank 1. Since rank(A + B) < rank(A) + rank(B) 

and M f = YT=i m ^ we have rank(f) = rank(M f ) < J2T=P rank{Mi) = m(f). 

m(f) < rank(f): Suppose rank(f) = r. Then there are r columns c±, . . . ,c r in Mf which span 
the column space of Mf. Let A be the 2"xr matrix that has these c, as columns. Let B be the rx2" 
matrix whose ith column is formed by the r coefficients of the ith column of Mf when written out as a 
linear combination of ci, . . . , c r . Then Mf = AB, hence f(x, y) = Mf(x, y) = Ya=i A x iBi y . Defining 
functions a-i, bi by ai{x) = A x i and bi(y) = Bi y , we have m(f) < rank(f). □ 

Combined with Theorems || and ^] we obtain 
Corollary 5 Q*(f) > bg ^ (/) andC*(f) > logm(f). 

Accordingly, for lower bounds on quantum communication complexity it is important to be able to 
determine the decomposition number m(f). Often this is hard. It is much easier to determine the number 
of monomials mon(f) of / (which upper bounds m(/)). Below we show that in the special case where 
f(x, y) = g(x A y), these two numbers are the same.[] 

Below, a monomial is called even if it contains Xi iff it contains yi, for example 2x\Xzy\yz is even and 
x\x^y\ is not. A polynomial is even if each of its monomials is even. 

Lemma 2 Ifp : {0, l} n x {0, l} n — ► R is an even polynomial with k monomials, then m(p) = k. 

Proof Clearly m(p) < k. To prove the converse, consider DISJ(x,y) = ITf =1 (l — xiyi), the unique 
polynomial for the disjointness function. Note that this polynomial contains all and only even monomials 
(with coefficients ±1). Since DISJ has rank 2 n , it follows from Lemma |l| that DISJ cannot be decomposed 
in fewer then 2™ terms. We will show how a decomposition of p with m(p) < k would give rise to a 
decomposition of DISJ with fewer than 2™ terms. Suppose we can write 

m(p) 

V{x,y) = Yl ai(x)bi(y). 

i=l 

Let aXsYs be some even monomial in p and suppose the monomial XsYs in DISJ has coefficient c = ±1. 
Now whenever bX$ occurs in some Oj, replace that bX$ by (cb/a)Xg. Using the fact that p contains only 
even monomials, it is not hard to see that the new polynomial obtained in this way is the same as p, except 
that the monomial aX$Ys is replaced by cXgYs- 

Doing this sequentially for all monomials in p, we end up with a polynomial p' (with k monomials and 
m(p') < m{p)) which is a subpolynomial of DISJ, in the sense that each monomial in p' also occurs with 
the same coefficient in DISJ. Notice that by adding all 2 n — k missing DISJ-monomials to p' , we obtain a 
decomposition of DISJ with m(p') + 2 n — k terms. But any such decomposition needs at least 2 n terms, 
hence m(p') + 2 n — k > 2™, which implies k < m(p') < m(p). □ 

If f(x, y) = g(x A y) for some Boolean function g, then the polynomial that represents / is just the 
polynomial of g with the ith variable replaced by Xiy%. Hence such a polynomial is even, and we obtain: 

4 After learning about this result, Mario Szegedy (personal communication) came up with an alternative proof of this, using 
Fourier transforms. 



Corollary 6 If g : {0, l} n — ► {0, 1} and f(x, y) = g(xAy), then mon(g) = mon(f) = m(f) = rank(f). 

This gives a strong tool for lower bounding (quantum and classical) communication complexity when- 
ever / is of the form f(x, y) = g(x A y): logmon(g) < C*(f) < D(f). Below we give some applications. 

4.2 Symmetric functions 

As a first application we show that D(f) and Q*(f) are linearly related if f(x,y) = g(x A y) and g is 
symmetric (this follows from Corollary || below). Furthermore, we show that the classical randomized 
public-coin complexity R^ 1 (/) can be at most a logn-factor less than D(f) for such / (Theorem ^). We 
will assume without loss of generality that g(0) = 0, so the polynomial representing g does not have the 
constant- 1 monomial. 

Lemma 3 If g is a symmetric function whose lowest-weight 1 -input has Hamming weight t > and 
f{x, y) = g(x A y), then D lround (f) = log (£? =4 (?) + 1) + 1. 

Proof It is known (and easy to see) that D lround (f^ = \ Q g r + 1, where r is the number of different rows 
of Mf (this equals the number of different columns in our case, because f(x, y) = f(y, x)). We count r. 
Firstly, if \x\ < t then the a; -row contains only zeroes. Secondly, if x ^ x' and both |x| > t and \x'\ > t 
then it is easy to see that there exists a y such that \x A y\ = t and \x' A y\ < t (or vice versa), hence 
f[x, y) / f(x', y) so the x-row and x'-row are different. Accordingly, r equals the number of different x 
with \x\ > t, +1 for the 0-row, which gives the lemma. □ 



Lemma 4 If g is a symmetric function whose lowest-weight 1 -input has weight t > 0, then 
(1 " o(l)) log (EL* ( •)) < logmon(g) < log (£?=* Q)) . 

Proof The upper bound follows from the fact that g cannot have monomials of degree < t. For the lower 
bound we distinguish two cases. 

Case 1: t < n/2. It is known that every symmetric g has degree deg(g) = n — 0(n 0,548 ) [14]. That is, 
an interval / = [a, n] such that g has no monomials of any degree d £ / has length at most O(n 0548 ). This 
implies that every interval I = [a, b] (b > t) such that g has no monomials of any degree del has length at 
most 0(n 0,548 ) (by setting n — b variables to 0, we can reduce to a function on b variables where / occurs 
"at the end"). Since g must have monomials of degree t < n/2, g must contain a monomial of degree d for 
some d £ [n/2, n/2 + O(n 0548 )]. But because g is symmetric, it must then contain all Q) monomials of 
degree d. Hence by Stirling's approximation mon(g) > Q) > 2 n ~°^ n ' \ which implies the lemma. 

Case 2: t > n/2. It is easy to see that g must contain all (") monomials of degree t. Now 



n \ s-^ In 



(n-t+ l)mon(g) > (n - t + 1) > ^ 

Hence log mon(g) > log (£? =t (")) - log(n - t + 1) = (1 - o(l)) log (£? =t (?)). D 

The number mon(g) may be less then £™ =t ( n ). Consider the function g(xi, X2, X3) = x\ + 3:2 + 23 — 
X1X2 — X1X3 — X2X3 jJ27|]. Here mon(g) = 6 but £f =1 ( ) = 7. Hence the 1 — o(l) of Lemma || cannot be 
improved to 1 in general (it can if g is a threshold function). 

Combining the previous results: 



Corollary 7 If g is a symmetric function whose lowest-weight 1 -input has weight t > and f(x,y) = 
g(x A y), then (1 - o(l)) log (£? =4 (•)) < <?*(/) < D(f) < D^ nd (f) = log (E?=t (?) + l) + 1- 

Accordingly, for symmetric g the communication complexity (quantum and classical, with or without 
prior entanglement, 1-round and multi-round) equals log rank(f) up to small constant factors. In particular: 

Corollary 8 If g is symmetric and f(x, y) = g(x A y), then (1 - o(l))D(f) < C*(f) < D(f). 

We have shown that Q*(f) and D(f) are equal up to constant factors whenever f{x,y) = g(x A y) 
and g is symmetric. For such /, D(f) is also nearly equal to the classical bounded-error communication 
complexity R% u (/), where we allow Alice and Bob to share public coin flips. In order to prove this, we 
introduce the notion of O-block sensitivity in analogy to the notion of block sensitivity of Nisan p6|]. For 
input x G {0, l} n , let bsO x .(g) be the maximal number of disjoint sets S\, . . . , <S& of indices of variables, 
such that for every i we have (1) all S^-variables have value in x and (2) g(x) ^ g(x Si ), where x Si is the 
string obtained from x by setting all S^ -variables to 1. Let bsO(g) = max x bsO x (g). We now have: 

Lemma 5 If g is a symmetric function, then mon(g) < n 2hs0 ( g >. 

Proof Let t be the smallest number such that g t ^ gt+i, then bsO(g) > n—t. lit < n/2 then bsO(g) > n/2, 
so mon(g) < 2 n < n 2hti0( - 9 \ If t > n/2 then g has no monomials of degree < t, hence mon(g) < 

E?=mO<™ 2bs%) - ° 

Theorem 4 If g is a symmetric function and f(x, y) = g(x A y), then D(f) € 0{Bf^ (/) logn). 

Proof By Corollary [7] we have D(f) < (l + o(l)) log mon(g). Lemma| implies D(f) G O(bs0(g) logn). 
Using Razborov's lower bound technique for disjointness [ p0[ ] (see also p0| , Section 4.6]) we can easily 
show i?^" (/) £ ^(bs0(/)), which implies the theorem. □ 

This theorem is tight for the function defined by g{x) = 1 iff \x\ > n — 1. We have mon(g) = n + 1, so 
log n < D(f) < (1 + o(l)) log n. On the other hand, an 0(1) bounded-error public coin protocol can easily 
be derived from the well-known 0(l)-protocol for equality: Alice tests if |x| < n — 1, sends a if so and a 
1 if not. In the first case Alice and Bob know that f(x, y) = 0. In the second case, we have f(x, y) = 1 iff 
x = y or y = 1, which can be tested with 2 applications of the equality-protocol. Hence R^ 11 (/) G O(l). 

4.3 Monotone functions 

A second application concerns monotone problems. Lovasz and Saks [^l|] prove the log-rank conjecture for 
(among others) the following problem, which they call the union problem for C. Here C is a monotone set 
system (i.e. (A £ C A A C B) ^> B G C) over some size-n universe. Alice and Bob receive sets x and y 
(respectively) from this universe, and their task is to determine whether xUy G C. Identifying sets with their 
representation as n-bit strings, this problem can equivalently be viewed as a function /(x, y) = g{x V y), 
where g is a monotone increasing Boolean function. Note that it doesn't really matter whether we take g 
increasing or decreasing, nor whether we use x V y or x A y, as these problems can all be converted into each 
other via De Morgan's laws. Our translation of rank to number of monomials now allows us to rederive the 
Lovasz-Saks result without making use of their combinatorial lattice theoretical machinery. We just need 
the following, slightly modified, result from their paper (a proof is given in Appendix |B]): 

Theorem 5 (Lovasz and Saks) D(f) < (1 + log(C 1 (/) + 1))(2 + log rank(f)). 



Theorem 6 (Lovasz and Saks) Ifg is monotone and f(x, y) = g(xAy), then D(f) G 0((logrank(f)) 2 ). 

Proof Let Mi, . . . , M& be all the minimal monomials in g. Each M, induces a rectangle Ri = Si x Tj, 
where S^ = {x | Mj C x} and Tj = {y \ Mi C y}. Because 5 is monotone increasing, g(z) = 1 iff 2 
makes at least one Mi true. Hence f(x,y) = 1 iff there is an i such that (x, y) G Ri. Accordingly, the set 
of Ri is a 1-cover for / and C 1 (/) < k < mon(g) = rank(f) by Corollary ||. Plugging into Theorem || 
gives the theorem. □ 



Corollary 9 Ifg is monotone and f(x, y) = g(x A y), then D(f) G 0(Q*(f) 2 ). 

This result can be tightened for the special case of d-leve.1 AND-OR-trees. For example, let g be a 2-level 
AND-of-ORs on n variables with fan-out yjn and f(x, y) = g(x A y). Then g has (2^ — l)v™ monomials 
and hence Q*(f) > n/2. In contrast, the zero-error quantum complexity of / is 0(n 3 ' 4 logra) [Q]. 

5 Bounded-Error Protocols 

Here we generalize the above approach to bounded-error quantum protocols. Define the approximate 
rank of /, rank(f), as the minimum rank among all matrices M that approximate My entry-wise up to 
1/3. Let the approximate decomposition number fh(f) be the minimum m such that there exist functions 
ai(x),...,a m (x) and 61 (y),... ,b m (y) for which \f(x,y) - YZLi (h(x)bi(y)\ < 1/3 for all x, y. By the 
same proof as for Lemma [J] we obtain: 

Lemma 6 rank(f) = fh(f). 

By a proof similar to Theorem |] (again using methods from [33, 19], see Appendix |C]) we show 

logm(/) 



Theorem 7 Q 2 (f) > 

Unfortunately, it is much harder to prove bounds on rfi(f) than on m(/).[] In the exact case we have 
m(f) = mon(g) whenever f(x, y) = g(xAy), and mon(g) is often easy to determine. If something similar 
is true in the approximate case, then we obtain strong lower bounds on Q2(f), because our next theorem 
gives a bound on mon{g) in terms of the 0-block sensitivity defined in the previous section (the proof is 
deferred to Appendix |D|). 

Theorem 8 If g is a Boolean function, then rnon(g) > 2V bs0 (s)/ 12 . 

In particular, for DIS J(x, y) = NOR(xAy) it is easy to see that bsO(NOR) = n, hence log mon(NOR) > 
\JnjYl (the upper bound logmon(NOR) G 0(^fn\ogn) follows from the construction of a degree-y^ 
polynomial for OR in []27|]). Consequently, a proof that the approximate decomposition number fh(f) 
roughly equals mon(g) would give Q2(DISJ) G Q(^/n), nearly matching the 0(yJn\ogn) upper bound 
of ^. Since m(f) = mon(g) in the exact case, a result like m(f) ~ moh(g) might be doable. 

We end this section by proving some weaker lower bounds for disjointness. Firstly, disjointness has 
a bounded-error protocol with 0(y/n log n) qubits and 0(y / n) rounds [§], but if we restrict to 1-round 
protocols then a linear lower bound follows from a result of Nayak [^4J] : 

5 It is interesting to note that IP (the negation of IP) has less than maximal approximate decomposition number. For example for 
n = 2, m(f) = 4 but m(/) = 3. 



Theorem 9 Q£ round (DISJ) E n(n). 

Proof Suppose there exists a 1-round qubit protocol with m qubits: Alice sends a message M(x) of m 
qubits to Bob, and Bob then has sufficient information to establish whether Alice's x and Bob's y are 
disjoint. Note that M(x) is independent of y. If Bob's input is y = ej, then DISJ(x, y) is the negation of 
Alice's zth bit. But then the message is an (n, m, 2/3) quantum random access code [jl|]: by choosing input 
y = ei and continuing the protocol, Bob can extract from M(x) the ith bit of Alice (with probability > 2/3), 
for any 1 < i < n of his choice. For this the lower bound m > (1 — H(2/3))n > 0.08 n is known [g4j]. □ 



For multi-round quantum protocols for disjointness with bounded error probability we can only prove a 
logarithmic lower bound, using a technique from [ JT~2[ ] (we omit the proof for reasons of space; for the model 
without entanglement, the bound Q2(DISJ) E fi(logn) was already shown in [Q]). 

Proposition 1 Q^(DISJ) E ft(logn). 

Finally, for the case where we want to compute disjointness with very small error probability, we can 
prove an Q(n) bound. Here we use the subscript "e" to indicate qubit protocols (without prior entanglement) 
whose error probability is < e. We first give a bound for equality: 

Theorem 10 Ife < 2~ n , then Q e (EQ) > n/2. 

Proof By Lemma |6| and Theorem [7|, it suffices to show that an ^-approximation of the 2 n x 2 n identity 
matrix / requires full rank. Suppose that M approximates / entry- wise up to e but has rank < 2 n . Then M 



has some eigenvalue A = 0. Gersgorin's Disc Theorem (see [15, p. 31]) implies that all eigenvalues of M 
are in the set [Ji{z \ \z — Ma\ < Ri}, where Ri = J2j+i \Mij\. But if A = is in this set, then for some i 

1 - e < \Mu\ = |A - Mu\ <Ri< (2 n - l)e, hence e > 2~ n , contradiction. □ 

We reduce equality to disjointness. Let x, y E {0, l} n . Define x' E {0, l} 2n by replacing Xi by xixl in 
x, and y' E {0, l} 2 ™ by replacing j/j by ~yiVi m V- It is easy to see that EQ(x, y) = DISJ(x', y') so we have: 

Corollary 10 Ife < 2~ n , then Q E (DISJ) > n/4. 

6 Open Problems 

To end this paper, we identify three important open questions in quantum communication complexity. First, 
are Q* (/) and D(f) polynomially related for all total /, or at least for all / of the form f(x, y) = g(x A y)l 
We have proven this for some special cases here (g symmetric or monotone), but the general question 
remains open. There is a close analogy between the quantum communication complexity lower bounds 
presented here, and the quantum query complexity bounds obtained in [fin. Let deg(g) and mon(g) be, re- 
spectively, the degree and the number of monomials of the polynomial that represents g : {0, l} n — > {0, 1}. 
In [Q] it was shown that a quantum computer needs at least deg{g)/2 queries to the n variables to com- 
pute g, and that 0(deg(g) A ) queries suffice (see also p7|]). This implies that classical and quantum query 
complexity are polynomially related for all total /. Similarly, we have shown here that (log mon(g))/ '2 
qubits need to be communicated to compute f(x, y) = g{x A y). An analogous upper bound like Q*(f) E 
O '((log mon(g)) k ) might be true. A similar resemblance holds in the bounded-error case. Let deg(g) be 
the minimum degree of polynomials that approximate g. In [Q] it was shown that a bounded-error quan- 
tum computer needs at least deg(g)/2 queries to compute g and that 0(deg(g) 6 ) queries suffice. Here we 
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showed that (log m(/))/2 qubits of communication are necessary to compute /. A similar upper bound like 
Q 2 (/)GO((logm(/)) fc ) may hold. 

A second open question: how do we prove good lower bounds on bounded-error quantum protocols? 
Theorems ^ and || of the previous section show that Qz(f) is lower bounded by logm(/)/2 and log mon(g) 
is lower bounded by ^/bsO(<?). If we could show fh(f) pss rnoh(g) whenever f(x, y) = g(x A y), we would 
have Q2C/O £ fi(\/bsO(g)). Since m(f) = mon(g) in the exact case, this may well be true. As mentioned 
above, this is particularly interesting because it would give a near-optimal lower bound Q2(DISJ) G Q,(yfn). 

Third and last, does prior entanglement add much power to qubit communication, or are Q(f) and 
Q* (/) roughly equal up to small additive or multiplicative factors? Similarly, are Q2H) and Q* 2 {f) roughly 
equal? The biggest gap that we know is Q2(EQ) G G(logn) versus <5|(EQ) G 0(1). 

Acknowledgments. We acknowledge helpful discussions with Alain Tapp, who first came up with the idea 
of reusing entanglement used in Section ||. We also thank Michael Nielsen, Mario Szegedy, Barbara Terhal 
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A Proof of Theorem |I| 

Here we prove a log rank(f) lower bound for clean qubit protocols. 

Lemma 7 (Kremer/Yao) The final state of an t-qubit protocol (without prior entanglement) on input (x, y) 
can be written as 

J2 <Xi(x)Pi(y)\Mx))\u)\Bi(v)), 

where the oti{x) , f3i(y) are complex numbers and the Ai(x), Bi(y) are unit vectors. 

Proof The proof is by induction on £: 

Base step. For £ = the lemma is obvious. 

Induction step. Suppose after £ qubits of communication the state can be written as 

£ atWPiWIAiixWilWv)). (2) 

i&{o,iy 

We assume without loss of generality that it is Alice's turn: she applies Uc + \ (x) to her part and the channel. 
Note that there exist complex numbers ctio(x), an(x) and unit vectors Aio(x), An(x) such that 

(U t +i(x) ® I)\A(x))\ie)\Bi(y)) = a i0 (x)|^o(x))|0)|^(y)) + a^lAi^))!!)!^)). 

Thus every element of the superposition @ "splits in two" when we apply Ui + \. Accordingly, we can write 
the state after Ue+i in the form required by the lemma. □ 

Theorem | Q c (f) > logrank(f) + 1. 

Proof Consider a clean £-qubit protocol for /. By Lemma [7], we can write its final state as 

J2 ai(x)Pi(y)\Mx))\u)\Bi(v))' 

ie{o,iy 

The protocol is clean, so the final state is |0)|/(x, y))|0). Hence all parts of \Ai(x)} and \Bi(y)) other than 
|0) will cancel out, and we can assume without loss of generality that \Ai(x)) = \Bi(y)) = |0) for all i. 
Now the amplitude of the |0)|1)|0) -state is simply the sum of the amplitudes a.i{x)f3i(y) of the i for which 
ii = 1. This sum is either or 1, and is the acceptance probability P(x, y) of the protocol. Letting a(x) 
(resp. (3(y)) be the dimension-2^ 1 vector whose entries are oti{x) (resp. 0i(y)) for the i with ig = 1: 

P{x,y)= Yl a i( x )Pi(y) = a ( x ) T ■/%)• 

i:if=\ 

Since the protocol is exact, we must have P(x, y) = f(x, y). Hence if we define A as the \X\ x d matrix 
having the a(x) as rows and B as the d x \Y\ matrix having the (3(y) as columns, then Mf = AB. But now 



rank 



k(Mf) = rank(AB) < rank(A) < d < 2 , and the theorem follows. □ 
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B Proof of Theorem g 

Theorem | (Lovasz and Saks) £>(/) < (1 + log(C 1 (/) + 1))( 2 + logronfc(/)). 

Proof We will first give a protocol based on a 0-cover. Let c = C°(/) and i?i, . . . , R c be an optimal 
0-cover. Let R4 = Si x Tj. We will also use Si to denote the \Si\ X 2 n matrix of 5,-rows and Tj for the 
2 n x |Tj| matrix of Tj-columns. Call Ri type 1 if rank(Si) < rank(Mj)/2, and type 2 otherwise. Note 
that rank(Si) + rank(Ti) < rank(Mf), hence at least one of rank(Si) and rank(Ti) is < rank(Mf)/2. 

The protocol is specified recursively as follows. Alice checks if her x occurs in some type 1 Ri. If no, 
then she sends a to Bob; if yes, then she sends the index i and they continue with the reduced function 
g (obtained by shrinking Alice's domain to S{), which has rank(g) = rank(Si) < rank(Mf)/2, If Bob 
receives a 0, he checks if his y occurs in some type 2 Rj. If no, then he knows that (x, y) does not occur 
in any Ri, so f(x, y) = 1 and he sends a to Alice to tell her; if yes, then he sends j and they continue 
with the reduced function g, which has rank(g) = rank{Ti) < rank(Mf)/2 because Rj is type 2. Thus 
Alice and Bob either learn f(x,y) or reduce to a function g with rank(g) < rank(f)/2, at a cost of at 
most 1 + log(c + 1) bits. It now follows by induction on the rank that D(f) < (1 + log(C°(/) + 1))(1 + 
logrank(f)). 

Noting that C 1 (J) = C°(f) and \rank{f)-rank(f)\ < 1, we have D(f) = D(f) < (l+log(C°(/) + 
!))(! + log rankQ)) < (1 +log(C 1 (/) + 1))(2 + log rank(f)). □ 



C Proof of Theorem 

Theorem^/) >i^. 

Proof By Lemma [7| we can write the final state of an ^-qubit bounded-error protocol for / as 

i£{0,lY 

Let(f)(x,y) = Z^jeio.i}*- 1 a n( x ) f3ii(y)\An(x)}\l)\Bn(y)) be the part of the final state that corresponds to 
a 1 -output of the protocol. For i,j G {0, 1}^ _1 , define functions dij,bij by 



aij(x) = an(x)a j i{x){An(x)\A j i(x)) 



biM = ^ 1 {y)f3 jl {y){B il (y)\B ]l {y)) 
Note that the acceptance probability is 

P(x,y) = {(j)(x,y)\4>(x,y)) = ^ <Hj{x)bij{y). 

ij'efo.i^- 1 

We have now decomposed P(x, y) into 2 2i ~ 2 functions. However, we must have \P(x, y) — f(x, y)\ < 1/3 
for all x, y, hence 2 2 ^ 2 > fh{f). It follows that £ > (log m(/))/2 + 1. □ 
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D Proof of Theorem g 

Here we prove Theorem 0. The proof uses some tools from the degree-lower bound proofs of Nisan and 



Szegedy [ |27| , Section 3], including the following result from [13[ 31J]: 



Theorem 11 (Ehlich, Zeller; Rivlin, Cheney) Let p be a single-variate polynomial of degree deg{p) such 
that b\ < p(i) < b 2 for every integer < % < n, and the derivative satisfies \p'{x)\ > cfor some real 
< x < n. Then deg{p) > ^cn/(c + b 2 — bi). 

A hypergraph is a set system H C Vow{l, . . . , n}. The sets E G H are called the edges of i?. We call 
H an s-hypergraph if all i? G if satisfy \E\ > s. A set S C {1, . . . , n} is a blocking set for i7 if it "hits" 
every edge: 5 n £ / for all E G H. 

Lemma 8 Let g : {0, l} n — » {0, 1} be a Boolean function for which g(0) = and g(ei) = 1, p be a 
multilinear polynomial which approximates g (i.e. \g(x) — p(x)\ < 1/3 for all x £ {0, l} n ), and H be the 
\JnjYl-hypergraph formed by the set of all monomials of p that have degree > \JnjYl. Then H has no 
blocking set of size < n/2. 

Proof Assume, by way of contradiction, that there exists a blocking set S of H with |5| < n/2. Obtain 
restrictions h and q of g andp, respectively, on n— \S\ > n/2 variables by fixing all S- variables to 0. Then q 
approximates h and all monomials of q have degree < y/n/12 (all p-monomials of higher degree have been 
set to because S is a blocking set for H). Since q approximates h we have q(0) G [—1/3, 1/3], q(ei) G 
[2/3,4/3], and q(x) G [—1/3,4/3] for all other x G {0, l} n . By standard symmetrization techniques [ J231 , 
[27[ ], we can turn q into a single-variate polynomial r of degree < ^/n/12, such that r(0) G [—1/3, 1/3], 
r(l) G [2/3, 4/3], and r(i) G [-1/3, 4/3] for i G {2, . . . , n/2}. Since r(0) < 1/3 and r(l) > 2/3, we must 
have p'(x) > 1/3 for some real x G [0, 1]. But then deg(r) > y / {l/3)(n/2)/(l/3 + 4/3 + 1/3) = y^n/12 
by Theorem [TTJ, contradiction. □ 

The next lemma shows that H must be large if it has no blocking set of size < n/2: 

Lemma 9 If H is an s-hypergraph of size m < 2 s , then H has a blocking set of size < n/2. 

Proof We use the probabilistic method to show the existence of a blocking set S. Randomly choose a set S 
of n/2 elements. The probability that S does not hit some specific E G H is 

("n/f) = f(f-l)...(f-|£| + l) < |s, 

(n/ 2 ) n(n-l)...(n-|£|+l) " 

Then the probability that there is some edge E G H which is not hit by S is 

Pr[ \/ S does not hit E] < ^ Pr[S does not hit E] < Y, 2_|£| < m • 2" s < 1. 

EeH E&H E&H 

Thus with positive probability, S hits all E G H, which proves the existence of a blocking set. □ 

The above lemmas allow us to prove: 
Theorem | // g is a Boolean function, then mon(g) > 2V bs0 (9)/ 12 . 
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Proof Let p be a polynomial which approximates g with rnoh(g) monomials. Let b = bsO(g), and z and 
Si, . . . , Sb be the input and sets which achieve the 0-block sensitivity of g. We assume without loss of 
generality that g(z) = 0. 

We derive a b- variable Boolean function h(y±, . . . , y&) from g{x\, . . . , x n ) as follows: if j G Si then we 
replace Xj in g by j/j, and if j Sj for any i, then we fix Xj in g to the value z^. Note that h satisfies 

1. h(0) = g(z) = 

2. h(ei) = g{z s >) = 1 for all unit e t G {0, l} 6 

3. rnon(h) < moh(g), because we can easily derive an approximating polynomial for h from p, without 
increasing the number of monomials in p. 

It follows easily from combining the previous lemmas that any approximating polynomial for h requires at 
least 2 V b ' 12 monomials, which concludes the proof. □ 
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